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In the Claims 

For the convenience of the Examiner, all pending claims are set forth below, whether 
or not an amendment is made. Please amend the claims as follows: 

1. (Previously Presented) A system for identifying relationships between 
database records, comprising: 

a memory operable to store a plurality of records comprising a first record and at least 
one second record, each record comprising at least one of a plurality of tokens; and 

one or more processors collectively operable to: 

determine a weight associated with each of the tokens; 

generate a correlithm object associated with at least one of the tokens, the 
correlithm object comprising a plurality of values defining a first point in a particular space, 
the particular space defined by a plurality of dimensions and including a plurality of points; 



least one of the weights, the at least one relationship indicator identifying a level of 
relationship between the first record and at least one second record. 



the tokens is inversely proportional to a number of times that the token appears in the 
plurality of records. 

3. (Original) The system of Claim 1, wherein the 

one or more processors are collectively operable to determine the weight associated 
with one of the tokens using a formula of: 



where Count To ken represents a number of times that the token appears in the plurality of 
records, and Total To kens represents a number of times that all tokens appear in the plurality of 
records. 



generate a significance vector associated with the correlithm object; 

compare at least one second record to the first record; and 

determine at least one relationship indicator based on the comparison and at 



2. 



(Original) The system of Claim 1, wherein the weight associated with one of 
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4. (Previously Presented) The system of Claim 1, wherein the one or more 
processors are collectively operable to compare one of the second records to the first record 
by: 

identifying any common tokens, a common token comprising one of the tokens that 
appears in both the first record and the second record; and 

identifying a common count value for each common token, the common count value 
representing a minimum number of times that the common token appears in the first record 
and the second record. 

5. (Original) The system of Claim 1, wherein the relationship indicator 
associated with one of the second records when compared to the first record is determined 
using a formula of: 

j 

X (Weight Tokeni * Common Count Tokeni ) 

Relationship Indicator = — 

bcore Target Record 

where j represents a number of unique common tokens that appear in both the first record and 
the second record, Weight To ken i represents the weight associated with the ith common token, 
Common Count To ken i represents a minimum number of times that the z'th common token 
appears in either the first record or the second record, and Score F i rs t record represents a record 
score associated with the first record. 

6. (Original) The system of Claim 5, wherein the record score associated with 
the first record is determined using a formula of: 

k 

Record Score = £ ( Weight Token k * Count Tokenk) 

i=l 

where k represents a number of unique tokens associated with the first record, Weight To ken k 
represents the weight associated with the Mi unique token, and Count To ken k represents a 
number of times that the Mi unique token appears in the first record. 
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7. (Original) The system of Claim 1, wherein: 

each of the plurality of records is associated with at least one document; 
the one or more processors are collectively operable to compare a plurality of second 
records to the first record and determine a plurality of relationship indicators; and 
the one or more processors are further collectively operable to: 

select one or more of the second records based on the relationship indicators; 

and 

make the documents associated with the one or more second records available 

to a user. 

8. (Original) The system of Claim 7, wherein the one or more processors are 
collectively operable to select the one or more second records based on input from the user. 

9. (Original) The system of Claim 1, wherein the one or more processors are 
collectively operable to allow a user to select the first record, wherein selecting the first 
record comprises at least one of selecting one of the plurality of records and submitting a 
document that the one or more processors may use to generate the first record. 

10. (Original) The system of Claim 1, wherein the one or more processors are 
further collectively operable to generate a plurality of text files, each text file associated with 
one of a plurality of documents and comprising the at least one token contained in the 
associated document. 

11. (Original) The system of Claim 10, wherein the one or more processors are 
collectively operable to generate the plurality of text files by performing at least one of 
optical character recognition and file conversion on each of the documents. 

12. (Original) The system of Claim 10, wherein the one or more processors are 
further collectively operable to generate the plurality of records, each record associated with 
one of the text files and comprising the at least one token contained in the associated text file. 
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13. (Original) The system of Claim 12, wherein the one or more processors are 
collectively operable to generate one of the records by: 

identifying one-word tokens in one of the text files, the one-word tokens comprising 
individual words in the text file; 

inserting the one-word tokens into the record; 

selecting pairs of one-word tokens in the record, each pair of one-word tokens 
comprising consecutive one-word tokens in the record; 

combining the pairs of one-word tokens to produce two-word tokens; and 
inserting the two-word tokens into the record. 

14. (Original) The system of Claim 13, wherein the one or more processors are 
further collectively operable to ignore at least one stop word in the text file when identifying 
one-word tokens in one of the text files. 

15. (Original) The system of Claim 12, wherein the one or more processors are 
further collectively operable to: 

replace the tokens in the record with one or more token representations; and 
consolidate the record by ensuring that each unique token or token representation 
appears only once in the record. 

16. (Original) The system of Claim 1, wherein the one or more processors are 
further collectively operable to receive one or more documents using at least one of an 
interface coupled to a network, a drive operable to read at least one computer readable 
medium, and a scanner. 

17. (Original) The system of Claim 1, wherein the one or more processors are 
further collectively operable to: 

receive a query from a user; 

identify one or more records that satisfy the query; 

identify one or more documents associated with the one or more records; and 
make the one or more documents available to the user. 
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18. (Original) The system of Claim 1, wherein the one or more processors are 
further collectively operable to: 

generate a token table comprising a plurality of first entries, each first entry 
comprising one of the tokens, a token representation associated with the token, the weight 
associated with the token, and a first count value associated with the token, the first count 
value representing a number of times that the token appears in the plurality of records; 

generate a records table comprising a plurality of second entries, each second entry 
associated with one of the records and comprising one of the token representations and a 
second count value, the token representation in the second entry associated with one of the 
tokens contained in the record, the second count value representing a number of times that 
the token associated with the second entry appears in the record; and 

generate a records table index comprising a plurality of third entries, each third entry 
associated with one of the records and comprising an identification of at least one second 
entry associated with the record and a record score associated with the record. 

19. (Original) The system of Claim 18, wherein the one or more processors are 
further collectively operable to convert at least one of the plurality of records, the token table, 
the records table, and the records table index from a first format to a second format, the 
second format used by an external system. 

20. (Original) The system of Claim 1, wherein the one or more processors are 
further collectively operable to categorize each of the records based at least partially on the 
tokens contained in the records and locations of the tokens in the records. 



21. 



(Canceled) 
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22. (Previously Presented) The system of Claim 1, wherein: 

a distance between the first point and each of the plurality of points in the particular 
space defines a distribution having a standard deviation; and 

a number of values in the correlithm object associated with one of the tokens may be 
determined using a formula of: 

Number of Values = [Weight Token * Standard Deviation] 
where Weight To ken represents the weight associated with the token, and Standard Deviation 
represents the standard deviation of the distribution. 



23. (Canceled) 



24. (Previously Presented) The system of Claim 1, wherein the significance 
vector comprises a plurality of significance values, each significance value determined using 
a formula of: 

Weight Token * Standard Deviation 

Significance Value = — — : 7771 

Number of Values 

where Weight To ken represents the weight associated with the token, Standard Deviation 
represents the standard deviation of the distribution, and Number of Values represents a 
number of values defining the first point in the correlithm object. 

25. (Previously Presented) The system of Claim 1, wherein: 

the correlithm object comprises a first correlithm object, the first correlithm object 
associated with the significance vector comprising a first significance vector; 

the one or more processors are collectively operable to generate a first correlithm 
object and a first significance vector for each of the tokens; and 

the one or more processors are further collectively operable to generate a second 
correlithm object and a second significance vector associated with the first record, the second 
correlithm object comprising at least one of the first correlithm objects. 
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26. (Original) The system of Claim 25, wherein: 

the second correlithm object comprises one or more first entries and the second 
significance vector comprises one or more second entries, at least one first entry comprising 
one of the first correlithm objects; and 

a number of first entries in the second correlithm object and a number of second 
entries in the second significance vector are determined using a formula of: 

j 

Number of Entries = £ (Maximum Instances Token . ) 
i = i 

where j represents a number of unique tokens contained in the plurality of records, and 
Maximum Instances To ken i represents a maximum number of times that the ith unique token 
appears in a single record in the plurality of records. 

27. (Original) The system of Claim 26, wherein: 

each first entry in the second correlithm object is associated with an instance of one 
of the tokens; 

each first entry in the second correlithm object is also associated with one second 
entry in the second significance vector; and 

the one or more processors are collectively operable to generate the second 
significance vector by: 

determining whether the instance of the token associated with one of the first 
entries appears in the first record; 

inserting one or more non-zero significance values into the second entry 
associated with the first entry when the instance of the token associated with the first entry 
appears in the first record; and 

inserting one or more zero significance values into the second entry associated 
with the first entry when the instance of the token associated with the first entry does not 
appear in the first record. 
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28. (Original) The system of Claim 27, wherein: 

the one or more processors are collectively operable to generate a second correlithm 



the relationship indicator associated with one of the second records when compared to 
the first record is determined using a formula of: 



where N represents the number of first entries in the second correlithm objects and the 
number of second entries in the second significance vectors, ASj represents the significance 
values in the rth second entry of the second significance vector associated with the first 
record, BSi represents the significance values in the rth second entry of the second 
significance vector associated with the second record, Overlap A si,BSi and Overlap A si,ASi each 
represents an overlap value between the identified significance values in the second 
significance vectors, Stnd. Dist.j represents a standard distance associated with the , first 
correlithm objects contained in the rth first entries of the second correlithm objects, M 
represents the number of values in the first correlithm objects contained in the rth first entries 
of the second correlithm objects, Aj represents the jth value of the first correlithm object 
contained in the rth first entry of the second correlithm object associated with the first record, 
and Bj represents the jth value of the first correlithm object contained in the rth first entry of 
the second correlithm object associated with the second record. 

29. (Original) The system of Claim 28, wherein Overlap A si,BSi and Overlap A si,ASi 
each comprises one of a minimum of the identified significance values in the second 
significance vectors and a product of the identified significance values in the second 
significance vectors. 



object and a second significance vector for each of the first record and at least one second 
record; and 



Relationship Indicator = 
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30. (Original) The system of Claim 25, wherein: 

the second correlithm object comprises one or more first entries and the second 
significance vector comprises one or more second entries, at least one first entry comprising 
one of the first correlithm objects; and 

a number of first entries in the second correlithm object and a number of second 
entries in the second significance vector equal a number of unique tokens in the plurality of 
records. 

31. (Original) The system of Claim 30, wherein: 

each first entry in the second correlithm object is associated with one of the unique 

tokens; 

each first entry in the second correlithm object is also associated with one second 
entry in the second significance vector; and 

the one or more processors are collectively operable to generate the second 
significance vector by: 

determining a number of times that the unique token associated with the first 
entry appears in the first record; 

determining a maximum number of times that the unique token associated 
with the first entry appears in a single record in the plurality of records; 

inserting one or more significance values from the first significance vector 
associated with the unique token into the second entry associated with the first entry when 
the unique token associated with the first entry appears the maximum number of times in the 
first record; 

inserting one or more scaled significance values from the first significance 
vector associated with the unique token into the second entry associated with the first entry 
when the unique token associated with the first entry appears at least once but less than the 
maximum number of times in the first record; and 

inserting one or more zero significance values into the second entry associated 
with the first entry when the unique token associated with the first entry does not appear in 
the first record. 
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32. (Original) The system of Claim 1, wherein: 

at least one token comprises a first correlithm object; and 

at least one of the records comprises a second correlithm object, the second 
correlithm object comprising at least one of the first correlithm objects. 



DAL01:855441.1 



ATTORNEYS DOCKET NO. 
066300.0132 



PATENT APPLICATION 
10/081,620 



12 



33. (Previously Presented) A method for identifying relationships between 
database records, comprising: 

determining a weight associated with each of a plurality of tokens, each token 
contained in at least one of a plurality of records, the plurality of records comprising a first 
record and at least one second record; 

generating a correlithm object associated with at least one of the tokens, the 
correlithm object comprising a plurality of values defining a first point in a particular space, 
the particular space defined by a plurality of dimensions and including a plurality of points; 

generating a significance vector associated with the correlithm object; 

comparing at least one second record to the first record; and 

determining at least one relationship indicator based on the comparison and at least 
one of the weights, the at least one relationship indicator identifying a level of relationship 
between the first record and at least one second record. 

34. (Original) The method of Claim 33, wherein the weight associated with one 
of the tokens is determined using a formula of: 



where Count To ken represents a number of times that the token appears in the plurality of 
records, and Total To kens represents a number of times that all tokens appear in the plurality of 
records. 

35. (Original) The method of Claim 33, wherein comparing one second record to 
the first record comprises: 

identifying any common tokens, a common token comprising one of the tokens that 
appears in both records; and 

identifying a common count value for each common token, the common count value 
representing a minimum number of times that the common token appears in either record. 
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36. (Original) The method of Claim 33, wherein the relationship indicator 
associated with one of the second records when compared to the first record is determined 
using a formula of: 

j 

£(Weight Tokeni * Common Count Tokeni ) 

Relationship Indicator = — — 

acore Target Record 

where j represents a number of unique common tokens that appear in both the first record and 
the second record, Weight T oken i represents the weight associated with the zth common token, 
Common Count To ken i represents a minimum number of times that the ith common token 
appears in either the first record or the second record, and Score F i rs t record represents a record 
score associated with the first record. 

37. (Original) The method of Claim 36, wherein the record score associated with 
the first record is determined using a formula of: 

k 

Record Score = £ (Weight Tokenk * Count Token k ) 
i=i 

where k represents a number of unique tokens associated with the first record, WeightToken k 
represents the weight associated with the kth unique token, and Count To ken k represents a 
number of times that the kth unique token appears in the first record. 

38. (Original) The method of Claim 33, further comprising: 

generating a plurality of text files, each text file associated with one of a plurality of 
documents and comprising the at least one token contained in the associated document; and 

generating the plurality of records, each record associated with one of the text files 
and comprising the at least one token contained in the associated text file. 



DAL01:855441.1 



ATTORNEY'S DOCKET NO. 
066300.0132 



PATENT APPLICATION 
10/081,620 



14 



39. (Original) The method of Claim 38, wherein generating one of the records 
comprises: 

identifying one-word tokens in one of the text files, the one-word tokens comprising 
individual words in the text file; 

inserting the one-word tokens into the record; 

selecting pairs of one-word tokens in the record, each pair of one-word tokens 
comprising consecutive one-word tokens in the record; 

combining the pairs of one-word tokens to produce two-word tokens; and 
inserting the two-word tokens into the record. 

40. (Original) The method of Claim 33, further comprising: 

generating a token table comprising a plurality of first entries, each first entry 
comprising one of the tokens, a token representation associated with the token, the weight 
associated with the token, and a first count value associated with the token, the first count 
value representing a number of times that the token appears in the plurality of records; 

generating a records table comprising a plurality of second entries, each second entry 
associated with one of the records and comprising one of the token representations and a 
second count value, the token representation in the second entry associated with one of the 
tokens contained in the record, the second count value representing a number of times that 
the token associated with the second entry appears in the record; and 

generating a records table index comprising a plurality of third entries, each third 
entry associated with one of the records and comprising an identification of at least one 
second entry associated with the record and a record score associated with the record. 



41. 



(Canceled) 
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42. (Previously Presented) The method of Claim 33, wherein: 

a distance between the first point and each of the plurality of points in the particular 
space defines a distribution having a standard deviation; and 

a number of values in the correlithm object associated with one of the tokens may be 
determined using a formula of: 

Number of Values = |"Weight Token * Standard Deviation] 
where Weight To ken represents the weight associated with the token, and Standard Deviation 
represents the standard deviation of the distribution. 



43. (Canceled) 



44. (Previously Presented) The method of Claim 33, wherein the significance 
vector comprises a plurality of significance values, each significance value determined using 
a formula of: 

Weight Token * Standard Deviation 

Significance Value = — — — — : 

Number of Values 

where Weight To ken represents the weight associated with the token, Standard Deviation 
represents the standard deviation of the distribution, and Number of Values represents a 
number of values defining the first point in the correlithm object. 

45. (Previously Presented) The method of Claim 33, wherein the correlithm 
object comprises a first correlithm object, the first correlithm object associated with the 
significance vector comprising a first significance vector; 

wherein a first correlithm object and a first significance vector are generated for each 
of the tokens; and 

further comprising generating a second correlithm object and a second significance 
vector associated with the first record, the second correlithm object comprising at least one of 
the first correlithm objects. 
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46. (Original) The method of Claim 45, wherein: 

the second correlithm object comprises one or more first entries and the second 
significance vector comprises one or more second entries, at least one first entry comprising 
one of the first correlithm objects; and 

a number of first entries in the second correlithm object and a number of second 
entries in the second significance vector are determined using a formula of: 

j 

Number of Entries = £ (Maximum Instances Token . ) 

i = l 

where j represents a number of unique tokens contained in the plurality of records, and 
Maximum Instances To ken i represents a maximum number of times that the zth unique token 
appears in a single record in the plurality of records. 

47. (Original) The method of Claim 46, wherein: 

each first entry in the second correlithm object is associated with an instance of one 
of the tokens; 

each first entry in the second correlithm object is also associated with one second 
entry in the second significance vector; and 

generating the second significance vector comprises: 

determining whether the instance of the token associated with one of the first 
entries appears in the first record; 

inserting one or more non-zero significance values into the second entry 
associated with the first entry when the instance of the token associated with the first entry 
appears in the first record; and 

inserting one or more zero significance values into the second entry associated 
with the first entry when the instance of the token associated with the first entry does not 
appear in the first record. 
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48. (Original) The method of Claim 47, wherein: 

a second correlithm object and a second significance vector are generated for each of 
the first record and at least one second record; 

the relationship indicator associated with one of the second records when compared to 
the first record is determined using a formula of: 

N f ( M 2 \\ 

X Overlap ASuBSi * Stnd. Dist, 2 -£ (A j - Bj 



Relationship Indicator = 



X (Overlap ASjASi *Stnd. Dist, 2 ) 

where N represents the number of first entries in the second correlithm objects and the 
number of second entries in the second significance vectors, AS { represents the significance 
values in the ith second entry of the second significance vector associated with the first 
record, BSi represents the significance values in the ith second entry of the second 
significance vector associated with the second record, Overlap A si,BSi and Overlap A si,ASi each 
represents an overlap value between the identified significance values in the second 
significance vectors, Stnd. Dist.i represents a standard distance associated with the first 
correlithm objects contained in the ith first entries of the second correlithm objects, M 
represents the number of values in the first correlithm objects contained in the ith first entries 
of the second correlithm objects, Aj represents the jth value of the first correlithm object 
contained in the ith first entry of the second correlithm object associated with the first record, 
and Bj represents the jth value of the first correlithm object contained in the ith first entry of 
the second correlithm object associated with the second record; and 

Overlap ASi.BSi and Overlap ASi,ASi each comprises one of a minimum of the identified 
significance values in the second significance vectors and a product of the identified 
significance values in the second significance vectors. 
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49. (Original) The method of Claim 45, wherein: 

the second correlithm object comprises one or more first entries and the second 
significance vector comprises one or more second entries, at least one first entry comprising 
one of the first correlithm objects; and 

a number of first entries in the second correlithm object and a number of second 
entries in the second significance vector equal a number of unique tokens in the plurality of 
records. 

50. (Original) The method of Claim 49, wherein: 

each first entry in the second correlithm object is associated with one of the unique 

tokens; 

each first entry in the second correlithm object is also associated with one second 
entry in the second significance vector; and 

generating the second significance vector comprises: 

determining a number of times that the unique token associated with the first 
entry appears in the first record; 

determining a maximum number of times that the unique token associated 
with the first entry appears in a single record in the plurality of records; 

inserting one or more significance values from the first significance vector 
associated with the unique token into the second entry associated with the first entry when 
the unique token associated with the first entry appears the maximum number of times in the 
first record; 

inserting one or more scaled significance values from the first significance 
vector associated with the unique token into the second entry associated with the first entry 
when the unique token associated with the first entry appears at least once but less than the 
maximum number of times in the first record; and 

inserting one or more zero significance values into the second entry associated 
with the first entry when the unique token associated with the first entry does not appear in 
the first record. 
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5 1 . (Original) The method of Claim 33, wherein: 
at least one token comprises a first correlithm object; and 

at least one of the records comprises a second correlithm object, the second 
correlithm object comprising at least one of the first correlithm objects. 
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52. (Previously Presented) Software for identifying relationships between 
database records, the software embodied on at least one computer readable medium and 
operable when executed to: 

determine a weight associated with each of a plurality of tokens, each token contained 
in at least one of a plurality of records, the plurality of records comprising a first record and 
at least one second record; 

generate a correlithm object associated with at least one of the tokens, the correlithm 
object comprising a plurality of values defining a first point in a particular space, the 
particular space defined by a plurality of dimensions and including a plurality of points; 

generate a significance vector associated with the correlithm object; 

compare at least one second record to the first record; and 

determine at least one relationship indicator based on the comparison and at least one 
of the weights, the at least one relationship indicator identifying a level of relationship 
between the first record and at least one second record. 
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53. (Previously Presented) A system for identifying relationships between 
database records, comprising: 

means for storing a plurality of records comprising a first record and at least one 
second record, each record comprising at least one of a plurality of tokens; 

means for determining a weight associated with each of the tokens; 

means for generating a correlithm object associated with at least one of the tokens, 
the correlithm object comprising a plurality of values defining a first point in a particular 
space, the particular space defined by a plurality of dimensions and including a plurality of 
points; 

means for generating a significance vector associated with the correlithm object; 

means for comparing at least one second record to the first record; and 

means for determining at least one relationship indicator based on the comparison and 

at least one of the weights, the at least one relationship indicator identifying a level of 

relationship between the first record and at least one second record. 



54. - 114. 



(Canceled) 
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115. (Previously Presented) A method for identifying relationships between 
database records, comprising: 

communicating at least one of one or more documents, one or more text files, and one 
or more records to a server, each of the at least one of the documents, the text files, and the 
records comprising at least one of a plurality of tokens; and 

wherein the server is operable to: 

determine a weight associated with each of the tokens; 

generate a correlithm object associated with at least one of the tokens, the 
correlithm object comprising a plurality of values defining a first point in a particular space, 
the particular space defined by a plurality of dimensions and including a plurality of points; 

generate a significance vector associated with the correlithm object; 

compare two of the at least one of the documents, the text files and the 

records; and 

determine a relationship indicator based on the comparison and at least one of 
the weights, the at least one relationship indicator identifying a level of relationship between 
the two of the at least one of the documents, the text files and the records. 

116. -119. (Canceled) 
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